AITopics | loss component

Collaborating Authors

loss component

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Scalable Decision-Focused Learning through Cost-Sensitive Regression

Schutte, Noah, Berden, Senne, Guns, Tias, Postek, Krzysztof, Yorke-Smith, Neil

arXiv.org Machine LearningMay-19-2026

Many real-world combinatorial problems involve uncertain parameters, which can be predicted given contextual features and historical data. These `predict-then-optimize' or `contextual optimization' problems have gained significant attention: end-to-end training methods can now minimize the downstream task cost rather than the predictive error. However, despite their effectiveness, these decision-focused learning (DFL) approaches often rely on repeated solving of the underlying combinatorial optimization problem during training, making them computationally expensive and difficult to scale. We reframe the learning problem as a cost-sensitive multi-output regression problem: multi-output due to the combinatorial problem having multiple uncertain parameters, and cost-sensitive due to the downstream task cost being the real target. Our technical contribution is the formalization of multiple loss function components that follow from this reframing: cost-insensitive normalization, decision-aware asymmetric penalization of over- and underpredictions, and instance-based costs that mimic the true downstream task-based loss locally. These components require zero or one solve per training data instance, while requiring no further solves during training. Experiments show that the combination of loss components achieves comparable downstream task quality to the state of the art, while being significantly more efficient, enabling scaling to problem sizes that have not been tackled before with DFL.

artificial intelligence, instance-based cost, machine learning, (19 more...)

arXiv.org Machine Learning

2605.18005

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

4ca82782c5372a547c104929f03fe7a9-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 13:54:53 GMT

loss component, skeptical student, student, (14 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County > Los Angeles (0.32)

Industry: Education (0.76)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.54)

Add feedback

Uncertainty-Resilient Multimodal Learning via Consistency-Guided Cross-Modal Transfer

Jang, Hyo-Jeong

arXiv.org Artificial IntelligenceNov-21-2025

Multimodal learning systems often face substantial uncertainty due to noisy data, low-quality labels, and heterogeneous modality characteristics. These issues become especially critical in human-computer interaction settings, where data quality, semantic reliability, and annotation consistency vary across users and recording conditions. This thesis tackles these challenges by exploring uncertainty-resilient multimodal learning through consistency-guided cross-modal transfer. The central idea is to use cross-modal semantic consistency as a basis for robust representation learning. By projecting heterogeneous modalities into a shared latent space, the proposed framework mitigates modality gaps and uncovers structural relations that support uncertainty estimation and stable feature learning. Building on this foundation, the thesis investigates strategies to enhance semantic robustness, improve data efficiency, and reduce the impact of noise and imperfect supervision without relying on large, high-quality annotations. Experiments on multimodal affect-recognition benchmarks demonstrate that consistency-guided cross-modal transfer significantly improves model stability, discriminative ability, and robustness to noisy or incomplete supervision. Latent space analyses further show that the framework captures reliable cross-modal structure even under challenging conditions. Overall, this thesis offers a unified perspective on resilient multimodal learning by integrating uncertainty modeling, semantic alignment, and data-efficient supervision, providing practical insights for developing reliable and adaptive brain-computer interface systems.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2511.15741

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.68)
Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Cognitive Science (0.90)
(2 more...)

Add feedback

LLavaCode: Compressed Code Representations for Retrieval-Augmented Code Generation

Cherniuk, Daria, Sukhorukov, Nikita, Sushko, Nikita, Gusak, Daniil, Sivtsov, Danil, Tutubalina, Elena, Frolov, Evgeny

arXiv.org Artificial IntelligenceOct-23-2025

Retrieval-augmented generation has emerged as one of the most effective approaches for code completion, particularly when context from a surrounding repository is essential. However, incorporating context significantly extends sequence length, leading to slower inference - a critical limitation for interactive settings such as IDEs. In this work, we introduce LlavaCode, a framework that compresses code into compact, semantically rich representations interpretable by code LLM, enhancing generation quality while reducing the retrieved context to only a few compressed single-token vectors. Using a small projector module we can significantly increase the EM and ES metrics of coding model with negligible latency increase. Our experiments demonstrate that compressed context enables 20-38% reduction in Time-to-First-Token (TTFT) on line completion tasks compared to full-RAG pipelines.

large language model, machine learning, qwen2, (20 more...)

arXiv.org Artificial Intelligence

2510.19644

Country: Europe > Russia (0.15)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Supplementary Material: Dynamic Prompt Learning: Addressing Cross-Attention Leakage for T ext-Based Image Editing

Neural Information Processing SystemsOct-10-2025, 23:14:13 GMT

The input prompts, like "a cat and a dog" guide the search process to retrieve images

artificial intelligence, editing, machine learning, (13 more...)

Neural Information Processing Systems

Country: Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.05)

Industry: Media > Photography (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Impact of Loss Weight and Model Complexity on Physics-Informed Neural Networks for Computational Fluid Dynamics

Chou, Yi En, Liu, Te Hsin, Lin, Chao-An

arXiv.org Artificial IntelligenceOct-1-2025

Physics Informed Neural Networks offer a mesh free framework for solving PDEs but are highly sensitive to loss weight selection. We propose two dimensional analysis based weighting schemes, one based on quantifiable terms, and another also incorporating unquantifiable terms for more balanced training. Benchmarks on heat conduction, convection diffusion, and lid driven cavity flows show that the second scheme consistently improves stability and accuracy over equal weighting. Notably, in high Peclet number convection diffusion, where traditional solvers fail, PINNs with our scheme achieve stable, accurate predictions, highlighting their robustness and generalizability in CFD problems.

artificial intelligence, machine learning, pinn, (17 more...)

arXiv.org Artificial Intelligence

2509.21393

Country: Asia (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Stabilizing Humanoid Robot Trajectory Generation via Physics-Informed Learning and Control-Informed Steering

D'Elia, Evelyn, Viceconte, Paolo Maria, Rapetti, Lorenzo, Ferigo, Diego, Romualdi, Giulio, L'Erario, Giuseppe, Camoriano, Raffaello, Pucci, Daniele

arXiv.org Artificial IntelligenceSep-30-2025

Recent trends in humanoid robot control have successfully employed imitation learning to enable the learned generation of smooth, human-like trajectories from human data. While these approaches make more realistic motions possible, they are limited by the amount of available motion data, and do not incorporate prior knowledge about the physical laws governing the system and its interactions with the environment. Thus they may violate such laws, leading to divergent trajectories and sliding contacts which limit real-world stability. We address such limitations via a two-pronged learning strategy which leverages the known physics of the system and fundamental control principles. First, we encode physics priors during supervised imitation learning to promote trajectory feasibility. Second, we minimize drift at inference time by applying a proportional-integral controller directly to the generated output state. We validate our method on various locomotion behaviors for the ergoCub humanoid robot, where a physics-informed loss encourages zero contact foot velocity. Our experiments demonstrate that the proposed approach is compatible with multiple controllers on a real robot and significantly improves the accuracy and physical constraint conformity of generated trajectories.

artificial intelligence, machine learning, trajectory, (20 more...)

arXiv.org Artificial Intelligence

2509.24697

Country: Europe > Italy (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.93)

Add feedback

a267f936e54d7c10a2bb70dbe6ad7a89-Supplemental.pdf

Neural Information Processing SystemsAug-16-2025, 12:54:20 GMT

artificial intelligence, machine learning, qualitative result, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback

Filters

Collaborating Authors

loss component

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Scalable Decision-Focused Learning through Cost-Sensitive Regression

4ca82782c5372a547c104929f03fe7a9-Supplemental.pdf

5321b1dabcd2be188d796c21b733e8c7-Supplemental-Conference.pdf

4ca82782c5372a547c104929f03fe7a9-Supplemental.pdf

Uncertainty-Resilient Multimodal Learning via Consistency-Guided Cross-Modal Transfer

LLavaCode: Compressed Code Representations for Retrieval-Augmented Code Generation

Supplementary Material: Dynamic Prompt Learning: Addressing Cross-Attention Leakage for T ext-Based Image Editing

Impact of Loss Weight and Model Complexity on Physics-Informed Neural Networks for Computational Fluid Dynamics

Stabilizing Humanoid Robot Trajectory Generation via Physics-Informed Learning and Control-Informed Steering

a267f936e54d7c10a2bb70dbe6ad7a89-Supplemental.pdf